Probabalistic Models and Informative Subspaces for Audiovisual Correspondence

نویسندگان

  • John W. Fisher
  • Trevor Darrell
چکیده

We propose a probabalistic model of single source multimodal generation and show how algorithms for maximizing mutual information can find the correspondences between components of each signal. We show how non-parametric techniques for finding informative subspaces can capture the complex statistical relationship between signals in different modalities. We extend a previous technique for finding informative subspaces to include new priors on the projection weights, yielding more robust results. Applied to human speakers, our model can find the relationship between audio speech and video of facial motion, and partially segment out background events in both channels. We present new results on the problem of audio-visual verification, and show how the audio and video of a speaker can be matched even when no prior model of the speaker’s voice or appearance is available.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Shift Invariant Spaces and Shift Preserving Operators on Locally Compact Abelian Groups

We investigate shift invariant subspaces of $L^2(G)$, where $G$ is a locally compact abelian group. We show that every shift invariant space can be decomposed as an orthogonal sum of spaces each of which is generated by a single function whose shifts form a Parseval frame. For a second countable locally compact abelian group $G$ we prove a useful Hilbert space isomorphism, introduce range funct...

متن کامل

بررسی پیش‌نویس معاهده حمایت از اجراهای دیداری ـ شنیداری سازمان جهانی مالکیت فکری

The Rome convention 1967, protects performers for the first time but protection on audiovisual performances was limited by the article 19.The protection of performers were exceed in audiovisual performances in proposed draft on WPPT but the protection of audiovisual performances were excluded in the WPPT treaty. So the meeting including WIPO members and ::::::::union::::::::s was held in March ...

متن کامل

Audiovisual Programs As Sources Of Language Input: An Overview

Audiovisual devices such as satellite and conventional televisions can offer easy access to authentic programs which are considered to be a rich source of language input for SLA (Second Language Acquisition). The immediacy of various audiovisual programs ensures that language learners’ exposure is up-to-date and embedded in the real world of native speakers. In the same line, in the present pap...

متن کامل

Audiovisual Programs As Sources Of Language Input: An Overview

Audiovisual devices such as satellite and conventional televisions can offer easy access to authentic programs which are considered to be a rich source of language input for SLA (Second Language Acquisition). The immediacy of various audiovisual programs ensures that language learners’ exposure is up-to-date and embedded in the real world of native speakers. In the same line, in the present pap...

متن کامل

Meaning-Focused Audiovisual Feedback and EFL Writing Motivation

  Having in mind the high level of challenge in writing as a foreign language, this research provided Iranian EFL learners with audiovisual feedbacks as an alternative of common written feedback and focused on its’ meaningfulness in order to provide an incentive medium for the participants and increase their moti-vation. One hundred young adult female learners in addition to six English languag...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002